Validation bits and offsets to represent logical pages split between data containers

ABSTRACT

A flash memory codeword architecture is provided. A non-integer count of logical pages is packed into a codeword payload data container. A codeword payload header is generated. The codeword payload header includes an offset to a first logical page that is packed, at least in part, into the codeword payload data container. The codeword payload data container and the codeword payload header are concatenated to generate a codeword payload. Error-correcting code data is generated based, at least in part, on the codeword payload using a systematic error-correcting code. The codeword payload and error-correcting code data is concatenated to generate a codeword. A physical page is programmed with the codeword.

BACKGROUND OF THE INVENTION

The present invention relates generally to the field of data processingand storage, and more particularly to flash memory codewordarchitectures.

Memory devices are typically provided as internal integrated circuits incomputers and other electronic devices. There are many different typesof memory including volatile and non-volatile memory. Volatile memorytypically requires power to maintain data (e.g., user data orerror-correcting data). Non-volatile memory can provide persistent databy retaining stored data when not powered and can include NAND flashmemory, NOR flash memory, read only memory (ROM), and electricallyerasable programmable ROM (EEPROM), among various other types of memory.

Memory devices can be combined together to form a storage volume of amemory system such as a solid state drive (SSD). Solid state drivesgenerally include NAND flash memory and/or NOR flash memory andcontrollers that utilize smaller amounts of volatile memory (e.g.,dynamic random access memory) in the form of buffers and/or caches.Solid state drives incorporating NAND flash memory increasingly replaceor supplement hard disk drives (HDDs) in various computing devices dueto advantages in terms of performance, size, weight, ruggedness,operating temperature range, and power consumption. In general, SSDssignificantly outperform HDDs in terms of I/O performance due, at leastin part, to a lack of moving parts that eliminates seek times and otherelectro-mechanical delays that reduce HDD performance.

SUMMARY

According to one embodiment of the present disclosure, a method forprogramming flash memory is provided. The method includes packing, byone or more storage controllers, a non-integer count of logical pagesinto a codeword payload data container; generating, by one or morestorage controllers, a codeword payload header that includes an offsetto a first logical page packed, at least in part, into the codewordpayload data container; concatenating, by one or more storagecontrollers, the codeword payload data container and the codewordpayload header to generate a codeword payload; generating, by one ormore storage controllers, error-correcting code data based, at least inpart, on the codeword payload using a systematic error-correcting code;concatenating, by one or more storage controllers, the codeword payloadand error-correcting code data to generate a codeword; and programming,by one or more storage controllers, a physical page with the codeword.

According to another embodiment of the present disclosure, a computerprogram product for programming flash memory is provided. The computerprogram product comprises a computer readable storage medium and programinstructions stored on the computer readable storage medium. The programinstructions include program instructions to program instructions topack a non-integer count of logical pages into a codeword payload datacontainer; program instructions to generate a codeword payload headerthat includes an offset to a first logical page packed, at least inpart, into the codeword payload data container; program instructions toconcatenate the codeword payload data container and the codeword payloadheader to generate a codeword payload; program instructions to generateerror-correcting code data based, at least in part, on the codewordpayload using a systematic error-correcting code; program instructionsto concatenate the codeword payload and error-correcting code data togenerate a codeword; and program instructions to program a physical pagewith the codeword.

According to another embodiment of the present disclosure, a computersystem for programming flash memory is provided. The computer systemincludes one or more storage controllers, one or more computer readablestorage media, and program instructions stored on the computer readablestorage media for execution by at least one of the one or more storagecontrollers. The program instructions include program instructions toprogram instructions to pack a non-integer count of logical pages into acodeword payload data container; program instructions to generate acodeword payload header that includes an offset to a first logical pagepacked, at least in part, into the codeword payload data container;program instructions to concatenate the codeword payload data containerand the codeword payload header to generate a codeword payload; programinstructions to generate error-correcting code data based, at least inpart, on the codeword payload using a systematic error-correcting code;program instructions to concatenate the codeword payload anderror-correcting code data to generate a codeword; and programinstructions to program a physical page with the codeword.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a block diagram of a computer system, in accordance with anembodiment of the present disclosure.

FIG. 2 is a diagram that depicts a packing system for packing logicalpages into codeword payload data containers, in accordance with anembodiment of the present disclosure.

FIG. 3 is a flowchart depicting a method of programming physical pageswith codewords, in accordance with an embodiment of the presentdisclosure.

FIG. 4 is a block diagram that depicts the transformation of a codewordpayload data container into a codeword that is programmed into flashmemory, in accordance with an embodiment of the present disclosure.

FIGS. 5A and 5B are flowcharts that depict a method of accessing logicalpages that are associated with one or more codewords, in accordance withan embodiment of the present disclosure.

DETAILED DESCRIPTION

A logical page is a fixed-length contiguous block of virtual memory. Ingeneral, it is the smallest unit of data that has an address in virtualmemory. Virtual memory enables operating systems to utilize a virtualaddress space that is larger than the physical address space availablein backing computer storage media (e.g., random-access memory, flashmemory, or magnetic hard disk drives). Using the virtual address space,operating systems are able to provide processes with the illusion thatlarge, contiguous sections of the virtual address space are available tothe processes. In reality, data that corresponds to a contiguous rangeof virtual memory addresses can be stored at discontiguous physicaladdresses in memory and/or paged out to non-volatile computer storagemedia (e.g., flash memory and/or hard disk drives).

In a flash memory device, such as a SSD that includes NAND flash memory,data corresponding to virtual addresses is stored by programmingindividual cells. In NAND devices, cells are the smallest unit in anorganizational hierarchy that includes pages, blocks, planes, and dies.Flash memory devices typically include single-level cells, multi-levelcells, or triple-level cells. A single-level cell can store one bit ofdata, a multi-level cell can store two bits of data, and a triple-levelcell can store three bits of data. In NAND devices, cells are organizedinto pages of various sizes. Typical page sizes include five hundred andtwelve bytes, two kilobytes, four kilobytes, eight kilobytes, andsixteen kilobytes. In addition, a few bytes of data (e.g., 1/32 of thepage size) are generally associated with each NAND page for storage ofcontrol and error-correcting code (ECC) data. Similarly, pages areorganized into blocks of various sizes. Typical block sizes includethirty-two pages, sixty-four pages, and one hundred and twenty-eightpages of various sizes plus the associated control and ECC data.

In a NAND device, read operations are performed on a page basis, and itis not possible to read less than one page. Consequently, a readoperation generally returns more information than requested. Writeoperations are also performed on a page basis in NAND devices, and it isnot possible to write (i.e., program) less than a full page, even ifless than all bytes in the page are affected by the write operation.Unlike read and write operations, erasures are performed on a blockbasis. In NAND devices, it is not possible to erase less than one fullblock.

Embodiments of the present disclosure recognize that, in at least somecases, physical page sizes are larger than logical page sizes. Whilesome instruction set architectures support logical page sizes ofsixty-four kilobytes or more, four kilobytes (i.e., 4,096 bytes) is acommon default page size. Similarly, a physical page size of fourkilobytes is common. In general, it is beneficial to provision a systemsuch that logical pages and physical pages are of the same size. Due tosmaller manufacturing processing nodes and the introduction ofmulti-level cells and triple-level cells, however, larger physical pagesizes (e.g., page sizes of eight and sixteen kilobytes) are increasinglycommon. In addition, physical page sizes are increasingly a non-integermultiple of logical page sizes, and physical page sizes generally differbetween suppliers. In systems where physical page sizes are larger thanlogical page sizes, inefficiencies can exist. Embodiments of the presentdisclosure provide a flash memory architecture that uses codewords tosupport efficient use of physical page sizes that are larger thanlogical page sizes.

The present disclosure will now be described in detail with reference tothe Figures. FIG. 1 is a block diagram of computer system, in accordancewith an embodiment of the present disclosure. Computer system 100includes communications fabric 108, which provides communicationsbetween computer processor(s) 102, memory 104, persistent storage 110,communications unit 114, and input/output (I/O) interface(s) 112.Communications fabric 108 can be implemented with any architecturedesigned for passing data and/or control information between processors(such as microprocessors, communications and network processors, etc.),system memory, peripheral devices, and any other hardware componentswithin a system. For example, communications fabric 108 can beimplemented with one or more buses.

Memory 104 and persistent storage 110 are computer readable storagemedia. In various embodiments, memory 104 includes random access memory(RAM). In general, memory 104 can include any suitable volatile ornon-volatile computer readable storage media. Cache 106 is a fast memorythat enhances the performance of processors 102 by holding recentlyaccessed data and data near accessed data from memory 104.

Program instructions and data can be stored in persistent storage 110for execution by one or more of the processors 102 via cache 106 andmemory 104. Persistent storage 110 includes one or more logical volumesof NAND flash memory (i.e., flash volume(s) 120) and controller 122.Persistent storage 110 also stores storage logic 124. As describedherein, controller 122 executes storage logic 124 to, at least in part,manage flash volume(s) 120. In some embodiments, persistent storage 110includes a local cache (i.e., a cache that is separate from cache 106)that can store, among other things, instructions that enable controller122 to execute storage logic 124. In some embodiments, persistentstorage 110 is an SSD. In such embodiments, flash volume(s) 120represent one or more volumes of the SSD, controller 122 represents acontroller of the SSD, and storage logic 124 resides on controller 122(or is otherwise accessible to and executable by controller 122). Forexample, controller 122 may represent a processing unit of persistentstorage 110 that is communicatively connected to and controls flashvolume(s) 120. In other embodiments, persistent storage 110 includes aplurality of SSDs (e.g., a homogenous array of SSDs or a heterogeneousarray of SSDs and HDDs). In embodiments that include a plurality ofSSDs, one or more SSDs of the plurality can include an instance ofcontroller 122 that has an instance of and executes storage logic 124 tostore data that is allocated to the respective SSD. In yet otherembodiments, persistent storage 110 is a hybrid drive that includes oneor more logical volumes of flash memory (i.e., flash volumes 120) andone or more logical volumes that reside on hard disk(s).

Communications unit 114, in these examples, provides for communicationswith other data processing systems or devices. In these examples,communications unit 114 includes one or more network interface cards.Communications unit 114 may provide communications through the use ofeither or both physical and wireless communications links. Programinstructions and data used to practice embodiments of the presentinvention and storage logic 124 may be downloaded to persistent storage110 through communications unit 114.

I/O interface(s) 112 allows for input and output of data with otherdevices that may be connected to each computer system. For example, I/Ointerface 112 may provide a connection to external devices 116 such as akeyboard, keypad, a touch screen, and/or some other suitable inputdevice. External devices 116 can also include portable computer readablestorage media such as, for example, thumb drives, portable optical ormagnetic disks, and memory cards. Software and data used to practiceembodiments of the present invention can be stored on such portablecomputer readable storage media and can be loaded onto persistentstorage 110 via I/O interface(s) 112. I/O interface(s) 112 also connectto a display 118.

Display 118 provides a mechanism to display data to a user and may be,for example, a computer monitor.

FIG. 2 is a diagram that depicts a packing system for packing logicalpages into codeword payload data containers, in accordance with anembodiment of the present disclosure. For example, packing system 200 isa system in which codeword payload data containers can includenon-integer counts of logical pages. Codeword payload data containers202, 204, 206, 208, 210 represent a range of codeword payload datacontainers within one block of memory that includes a total of ncodeword payload data containers into which m logical pages are packed,as described herein. Logical pages 216, 218F/S, 220, 222F/S, 224,226F/S, 228, 248S, and 250 are examples of the m logical pages that arepacked into the range of n codeword payload data containers thatincludes codeword payload data containers 202, 204, 206, 208, and 210.The codeword payload data containers and the logical pages are sizedsuch that the codeword payload data containers are not divisible by thelogical pages. Consequently, logical pages that are not self-containedmust straddle at least some boundaries between codeword payload datacontainers. Packing system 200, however, prohibits logical pages fromstraddling the across the second boundary of the final codeword payloaddata container in a block of flash memory (i.e., logical pages cannotstraddle across blocks of memory). Accordingly, codeword payload datacontainer 210 includes unused space 212 because codeword payload datacontainer 210 is the final codeword payload container in the memoryblock. In the event that the number of logical pages in a block equalsone plus a second value, wherein the second value is a common multiplierof the size of a logical page and the size of a codeword payload datacontainer, then the first boundary of the last codeword payload datacontainer will align with the last logical page. In this circumstance,it is possible for the last codeword payload data container in a block(i.e., the last codeword in a block) to store an integer count oflogical pages to prevent straddling across a block boundary. Forexample, the last codeword payload data container in a block could storea single self-contained logical page that is aligned with the firstboundary of the codeword payload data container and include unused spacein the remaining portion of the codeword payload data container.

In the embodiment depicted in FIG. 2, the size of the logical pages is vbytes and the size of the codeword payload data containers is x bytes.In general, the logical pages and codeword payload data containers aresized such that the codeword payload data containers can include anon-integer count of logical pages. While the codeword payload datacontainers and logical pages are not drawn to scale in FIG. 2, codewordpayload data containers 202, 204, and 208 depict three ways in whichlogical pages can be arranged with respect to codeword payload datacontainer boundaries. In codeword payload data container 202, forexample, logical page 216 is aligned with the first boundary of thecontainer and does not straddle container boundaries (i.e., logical page216 is self-contained). The first portion of the subsequent logical page(i.e., logical page 218F) straddles out of codeword payload datacontainer 202 and across the boundary between codeword payload datacontainers 202 and 204. Consequently, the second portion of the logicalpage (i.e., logical page 218S) straddles into codeword payload datacontainer 204. In codeword payload data container 204, logical page 220is not aligned with either the first boundary or the second boundary ofthe container, but it does not straddle out of the container (i.e.,logical page 220 is self-contained). The subsequent logical page,however, straddles across codeword payload data containers 204 and 206.Accordingly, logical page 222F (i.e., the first portion of logical page222) is stored in the remainder of codeword payload data container 204,and codeword payload data container 206 includes logical page 222S(i.e., the second portion of logical page 222). Similarly, logical page224 is not aligned with either the first or second boundaries ofcodeword payload data container 206 but does not straddle acrosscontainer boundaries (i.e., logical page 224 is similarlyself-contained). Codeword payload data container 206 also includeslogical page 226F. Codeword payload data container 208 depicts anotherpossible alignment of logical pages and codeword payload datacontainers. In codeword payload data container 208, the second portionof logical page 226 (i.e., logical page 226S) straddles into codewordpayload data container 208 such that logical page 228 is aligned withthe second boundary of codeword payload data container 208 and does notstraddle out of the container (i.e., logical page 228 isself-contained). In addition to the arrangements of logical pagesdepicted in FIG. 2, it is also possible that a codeword payload datacontainer includes a second portion of a first logical page and a firstportion of a second logical page such that no logical page isself-contained within the codeword payload data container. The ratio oflogical page size to codeword payload data container size determines, atleast in part, how logical pages are arranged with respect to codewordpayload data container boundaries.

In FIG. 2, codeword payload data container 210 represents the finalcodeword payload data container within a block of flash memory. Becauseflash memory is typically erased in blocks, logical pages are notpermitted to straddle across block boundaries. In the embodimentdepicted in FIG. 2, the size ratio of logical pages to codeword payloaddata containers, the number of codeword per physical page, and thenumber of physical pages per block of memory is such that codewordpayload data container 210 includes unused space 212, in addition tological pages 248S and 250, to prevent straddling across blockboundaries. In other embodiments, however, the size ratio, the number ofcodewords per physical page, and the number of physical pages per blockis such that logical pages align with the block boundaries such thatunused space is not required to prevent straddling across blockboundaries.

FIG. 3 is a flowchart depicting a method of programming physical pageswith codewords, in accordance with an embodiment of the presentdisclosure. Storage logic 124 includes method 300. Controller 122executes storage logic 124 to perform method 300 and thereby programpages in flash volume(s) 120 with codewords, as described herein.

Method 300 includes packing a non-integer count of logical pages into acodeword payload data container (305), as described herein with respectto FIG. 2. Based, at least in part, on how the logical pages are packedinto the codeword payload data container, one or more offset values arecalculated. In general, an offset value indicates the position of thefirst memory cell of a logical page in relation to the first boundary ofthe codeword payload data container. The offset value also defines theboundary between two logical pages in a codeword payload data container.In some embodiments, the offset value is a value that describes thememory cell that stores the first bit of the first logical page thatdoes not straddle into the codeword payload data container (i.e., aself-contained logical page or a logical page that straddles out of thecodeword payload data container). If, for example, the offset value iszero, the first logical page in the codeword payload data container isaligned with the first boundary of the codeword payload data container(e.g., logical page 216 in FIG. 2). If the offset value is non-zero, thelogical page can be self-contained within the codeword payload datacontainer or straddle out of the codeword payload data container.Depending on the non-zero offset value, the size of the logical pages,and the size of the codeword payload data container, the self-containedlogical page can be aligned with second boundary of the logical page orallow an integer or non-integer count of additional logical pages to bestored in the remainder of the codeword payload data container. Forexample, the additional logical pages could be a portion of a logicalpage that straddles out of the codeword payload data container, one ormore self-contained logical pages, or one or more self-contained logicalpages and a portion of a logical page that straddles out of the codewordpayload data container. Using the offset value and the size of a logicalpage, it is possible to determine the offset to any logical page thatdoes not straddle into the codeword payload data container. Accordingly,only the offset value to the first logical page that does not straddleinto the codeword payload data container is calculated in someembodiments. In other embodiments, however, offset values to any pagethat is self-contained or that straddles out of the codeword payloaddata container is calculated.

Based, at least in part, on the offset value(s) described above, acodeword payload header is generated (310) that includes the offsetvalue(s). In some embodiments, the codeword payload header also includesthe physical address of the physical page in flash memory that will beprogrammed with the codeword produced by method 300. In suchembodiments, the physical address in the codeword payload header can becompared to the physical address referenced in read commands in order toprovide a validation check.

Codeword payload data containers are concatenated with codeword payloadheaders to generate a codeword payload (315). The codeword payload issystematically encoded to generate a codeword (320). By using systematiccodes, the codeword payload is embedded in the output. Systematic codesare advantageous in this application because the error-correcting code(ECC) data can be concatenated with the codeword payload to generate thecodeword (325). In addition, the offset value(s) in the codeword payloadheader are preserved. The codeword payload can be encoded usingsystematic Hamming codes, systematic Reed-Solomon codes, or anothersystematic error-correcting code known in the art. While the specificcode used to generate the ECC data is not a limitation of the presentdisclosure, the size of the resulting ECC data can influence, amongother things, the size of the codeword payload data containers andcodeword payload headers.

To store the codeword in persistent storage 110, controller 122 programsa page in flash volume(s) 120 with the codeword (330). Depending on thesize of the codeword and the size of the page in flash volume(s) 120,two or more codewords can be programed into the page.

In embodiments like the one depicted in FIG. 2, the codeword payloaddata containers and the logical pages are sized such that no more thanone logical page can be self-contained within each codeword payload datacontainer. As described herein, five scenarios exist that describe howlogical pages can be arranged in such embodiments. In some embodiments,a set of three binary validation bits 214, wherein a programmed bit isrepresented by a zero, describes the five scenarios. For example, thescenario in which a logical page is aligned with the first boundary of acodeword payload data container (e.g., logical page 216) can berepresented by a set of three bits 214 having values of [011] (i.e., asingle zero indicates a self-contained logical page). Similarly, thescenario in which a logical page is aligned with the second boundary ofa codeword payload data container (e.g., logical page 228) can berepresented by a set of three bits 214 having values of [110]. Thescenario in which a logical page is self-contained in the codewordpayload data container but not aligned with a boundary can berepresented by a set of three bits 214 having values of [101] (i.e., thefirst bit is associated with a first boundary of a logical page and thethird bit is associated with the second boundary of the logical page).As described herein, it is also possible that a logical page straddlestwo codeword payload data containers. In this scenario, two codewordsand two sets of validation bits 214 are needed to describe thearrangement of the two portions of the logical page. For example, a setof three bits 214 having values of [001] can represent the portion ofthe logical page that straddles out of a first codeword payload datacontainer (i.e., a first portion of the logical page). Similarly, a setof three bits 214 having values of [100] can represent the portion ofthe logical page that straddles into a second codeword payload datacontainer (i.e., a second portion of the logical page). In theserepresentations, the two programmed bits (i.e., the two zeros) indicatestraddling and the position of the unaltered bit (i.e., the one)indicates whether the logical page straddles into or out of thecorresponding codeword.

In other embodiments, a set of two binary validation bits describes fourof the five scenarios. In one example of a two-bit system, a set of twobits having values of [00] represents three scenarios in which a logicalpage is self-contained. By comparing the offset to the sizes of thelogical pages and/or codeword payload data containers, it is possible todetermine whether the self-contained page is (1) aligned with the firstboundary of the codeword payload data container (e.g., the offset tological page 218F in FIG. 2 is V+1 bytes), (2) aligned with the secondboundary of the codeword payload data container (e.g., the offset tological page 228 is Y-V bytes), or (3) aligned with neither the firstnor second boundary of the codeword payload data container (e.g., theoffset of logical page 220 is less than Y-V bytes). In this example of atwo-bit system, a set of two bits having values [10] represents thescenario in which a logical page straddles into a codeword payload datacontainer, and a set of two bits having values of [01] represents thescenario in which a logical page straddles out of a codeword payloaddata container. Reading a logical page that straddles codeword payloaddata containers involves a reference to more than one codeword, asdescribed herein. Based, at least in part, on the referenced codewords(e.g., the order of the codewords), controller 122 executes storagelogic 124 to determine the size of the first portion of the logical page(e.g., logical page 218F) and the size of the second portion of thelogical page (e.g., logical page 218S) based, at least in part, on theoffsets associated with the two codewords (e.g., the codewords thatinclude codeword payload data containers 202 and 204).

In another example of a two-bit system: (1) a set of two bits havingvalues of [00] represents the scenario in which a self-contained logicalpage is aligned with neither the first nor second boundaries of acodeword payload data container; (2) sets of two bits having values of[01] and [10] respectively represent the scenarios in which aself-contained logical page is aligned with the first and secondboundaries of a codeword payload data container; and (3) a set of twobits having values of [11] represents the scenarios in which a logicalpage straddles across the boundaries of codeword payload datacontainers. Based, at least in part, on the referenced codewords (e.g.,the order of the codewords) and the associated offsets, controller 122executes storage logic 124 to determine the location and size of a firstportion of the logical page (e.g., logical page 218F) and the locationand size of a second portion of the logical page (e.g., logical page218S). This example of a two-bit system, as well as the three-bit systemdescribed herein, advantageously enable the recovery of at least somelogical pages in situations where the offset values are corrupted orotherwise unavailable. For example, logical pages that align with afirst boundary of a codeword payload data container (e.g., logical pagesassociated with validation bit sets [01] or [011]) can be recoveredusing the validation bits and knowledge of the size of a logical page.Similarly, logical pages that align with a second boundary of a codewordpayload data container (e.g., logical pages associated with validationbit sets [10] or [110]) can be recovered using the validation bits andknowledge of the sizes of a logical page and a codeword payload datacontainer. In situations where a logical page straddles boundariesbetween codeword payload containers, it is possible to extract both thefirst and second portions of the logical page if the offset associatedwith one of the two respective codewords is available.

The three validation bit systems described herein are examples ofvarious validation bit systems, and permutations of the validation bitsystems described herein are possible without departing from the scopeof the present disclosure. In addition, more validation bits can beadded to support codeword payload data containers that can include twoor more self-contained logical pages (e.g., a set of four validationbits to describe codeword payload data containers that can include twoself-contained logical pages).

In addition to programming page(s) in flash volume(s) 120, controller122 manages one or more data repositories that associate each logicalpage stored in flash volume(s) 120 with the page(s) on which they arestored, the codeword(s) that describe them, and the set(s) of validationbits that describe how the logical pages are arranged in thecodeword(s). For example, controller 122 can manage one or more of aflash translation layer, a page-level map, a block-level map, alog-block map or another mapping system that computer system 100 and/orpersistent storage 110 can use to access logical pages stored in flashvolume(s) 120.

FIG. 4 is a block diagram that depicts the transformation of a codewordpayload data container into a codeword that is programmed into flashmemory, in accordance with an embodiment of the present disclosure.Transformation 400 depicts the results of various operations of method300 as applied to the embodiment depicted in FIG. 2.

In FIG. 4, codeword payload 402 is depicted as a codeword payload datacontainer of x bytes that is concatenated with a codeword payload headerof y bytes. As shown in FIG. 4, codeword payload 402 has a size of x+ybytes. Encoding (320) codeword payload 402 generates z bytes of ECCdata. Accordingly, codeword payload 402 and the ECC data areconcatenated to generate codeword 412 that has a size of x+y+z bytes.Codeword 412 is programmed (330) into physical page 416. In theembodiment depicted in FIG. 4, physical page 416 has a size of 2(x+y+z)bytes and stores codeword 412 and codeword 414. Like codeword 412,codeword 414 has a size of x+y+z bytes and is generated using adifferent instance of method 300 that operates on a different codewordpayload data container.

FIGS. 5A and 5B are flowcharts that depict a method of accessing logicalpages that are associated with one or more codewords, in accordance withan embodiment of the present disclosure. Storage logic 124 includesmethod 500, which controller 122 executes to access (i.e., read) variouslogical pages in flash volume(s) 120. In the embodiment depicted inFIGS. 5A and 5B, the codewords (i.e., the codeword payload datacontainers) and logical pages are sized such that a codeword caninclude, at most, one self-contained logical page.

In response to receiving a command to read a logical page (502) in flashvolume(s) 120, controller 122 determines if the read command involvesmultiple codewords (504). In various embodiments, controller 122determines if the read command involves multiple codewords by searchingone or more data repositories such as a flash translation layer, apage-level map, a block-level map, a log-block map or another mappingsystem for the logical page. If the read command involves multiplecodewords, the logical page will be associated with more than onecodeword (and corresponding pages in flash volume(s) 120) and more thanone corresponding set(s) of validation bits in the aforementionedrepositories.

If controller 122 determines that the logical page is associated with asingle codeword (decision 504, NO branch), controller 122 accesses thecodeword (506) on a mapped page of flash volume(s) 120. In theembodiment depicted in FIG. 5A, controller 122 accesses the codewordpayload header to determine the offset to the logical page (508).Because the read involves a single codeword, the offset will identifythe beginning of the logical page that is referenced in the read commandbecause the logical page is necessarily self-contained in the codewordin this embodiment. Using the offset and the size of a complete logicalpage, controller 122 extracts the self-contained logical page from thecode word (510) and communicates the logical page to processors (102)and/or another computing device that is connected to computer system 100(512).

If controller 122 determines that the logical page is associated withmultiple codewords (decision 504, YES branch), controller 122 accessesthe first codeword. In the embodiment depicted in FIGS. 5A and 5B, thefirst codeword includes the first portion of the logical page referencedin the read command. In order to calculate the offset to the firstportion of the logical page, controller 122 access the codeword payloadheader. Using the offset, controller 122 determines whether the offsetalone or the offset plus the logical page size identifies the firstportion of the logical page (516). The offset alone will identify thefirst portion of the logical page if the offset plus the logical pagesize exceeds the logical page size. Based, at least in part, on thedetermination of the offset to the first portion of the logical page(i.e., the offset stored in the codeword payload header with or withoutadding the size of logical page), controller 122 extracts the firstportion of the logical page from the first codeword (518) and stores thefirst portion of the logical page in a cache or buffer that is includedin persistent storage 110 (520). If persistent storage 110 does notinclude a cache or buffer, controller 122 interacts with processor(s)102 or another component of computer system 100 to facilitate storage ofthe first portion of the logical page in memory 104 (i.e., main/systemmemory).

If a logical page straddles two codewords, it is possible that thelogical page straddles two pages in flash volume(s) 120. In general,there is no guarantee that the process of accessing a first page inflash volume(s) 120 and accessing a second page in flash volume(s) willcomplete in order. Consequently, reading a logical page that straddlespages in flash volume(s) 120 can incur significantly more latency thanreading a logical page that straddles two codewords of a single page inflash volume(s) 120. After requesting the second codeword (522),controller 122 determines if a time threshold has been exceeded prior toreceiving the second codeword from flash volume(s) 120 (524). The secondcodeword is the codeword that includes a second portion of the logicalpage. The second portion of the logical page is the portion of thelogical page that straddles into a codeword. In some embodiments, thetime threshold is set such that times greater than the threshold requireholding the first portion of the logical page in cache for anunacceptable amount of time. In other embodiments, the time threshold isset based, at least in part, on other parameters that determine whetheror not latencies and/or resource utilization is unacceptable. In someembodiments where latencies and/or resource utilization is not importantin this context, operations relating and subsequent to thisdetermination (i.e., decision 524, YES branch) are omitted. Ifcontroller 122 determines that the threshold is not exceeded (decision524, NO branch), controller 122 determines whether or not it hasreceived the second codeword from flash volume(s) 120. If controller 122determines that is has not received the second codeword (decision 526,NO branch), controller 122 will determine whether or not the thresholdis exceeded (524) and whether or not it has received the second codewordfrom flash volumes(s) 120 until the threshold is exceeded or controller122 receives the second codeword.

If controller 122 determines that it has received the second codewordfrom flash volume(s) 120 (decision 526, YES branch), controller 122extracts the second portion of the logical page (528). Controller 122extracts the second portion of the logical page by accessing thecodeword payload header of the second codeword to obtain the offset tothe first self-contained logical page or to the boundary between thesecond portion of the logical page and a logical page that straddles oris aligned with the second boundary of the second codeword. Using theoffset, controller 122 derives the size of the second portion of thelogical page. Using the first and second portions of the logical page,controller 122 reconstructs the logical page (530) and communicates thelogical page to processors (102) and/or another computing device that isconnected to computer system 100 (512).

If controller 122 determines that the threshold has been exceeded(decision 524, YES branch), controller 122 generates a tag for the firstportion of the logical page (532). The tag is associated the uncompletedread operation and is used to distinguish the uncompleted read operationfrom subsequent read operations to the same logical page. In someembodiments, the tag is a time stamp. In other embodiments, the tag isanother identifier that is used to distinguish the uncompleted readoperation. Controller 122 stores the first portion of the logical pageand a copy of the tag for later use (534). In embodiments wherepersistent storage 110 includes a buffer of volatile memory (e.g., aDRAM buffer) that is not also a cache for controller 122, controller 122stores the first portion of the logical page and the tag in the buffer.In other embodiments, controller 122 interacts with processor(s) 102 oranother component of computer system 100 to facilitate storage of thefirst portion of the logical page in memory 104 (i.e., main/systemmemory). Controller 122 retains, in cache, a copy of the tag and thephysical or logical address where the first portion of the logical pageis stored (536). To, at least in part, free resources for other readoperations, garbage collection, or other processes in persistent storage110, controller 122 deletes the first portion of the logical page fromcache (538).

After receiving the second codeword (540), controller 122 determines theoffset created by the second portion of the logical page (542). Todetermine the offset, controller 122 accesses the codeword payloadheader of the second codeword to obtain the offset to the firstself-contained logical page or to the boundary between the secondportion of the logical page and a logical page that straddles or thesecond boundary of the second codeword. Using the offset, controller 122derives the size of the second portion of the logical page and extractsthe second portion of the logical page from the second codeword (544).To reconstruct the logical page, controller 122 retrieves the firstportion of the logical page and the tag (546) using the address storedin cache. To ensure that retrieved first portion of the logical page isassociated with the read operation that retrieved the second portion ofthe logical page, controller determines whether or not the retrieved tagand the cached tag match (548). If the controller 122 determines thatthe tags do not match (decision 548, NO branch), controller retrievesportions of logical pages and corresponding tags until controller 122determines that the tags match. In some embodiments, controller 122 canretrieve any portions of any logical pages that are stored in the datarepository from which the first portion of the logical page wasinitially retrieved. In other embodiments, the tag or other information(e.g., the codeword, logical page address, or the address of associatedpage in flash volume(s) 120) is stored with the first portion of thelogical page before it is deleted from cache (538) to, at least in part,reduce the latency of the read operation. If controller 122 determinesthat the tags match, controller 122 reconstructs the logical page usingthe first and second portions of the logical page (530) and communicatesthe logical page to processors (102) and/or another computing devicethat is connected to computer system 100 (512).

In addition, controller 122 is capable of reading physical pages fromflash volume(s) 120 directly in some embodiments (i.e., reading aphysical page without reading a logical page). In this scenario,controller 122 reads all data contained in page(s) in flash volume(s)120 and ignores any boundaries between logical pages. In someembodiment, controller 122 can perform physical reads of individualcodeword instead of reading an entire page. Physical reads, as opposedto logical reads, can allow for restoration of data in some situationsand help to determine the reliability of flash volume(s) 120.

The present invention may be a system, a method, and/or a computerprogram product. The computer program product may include a computerreadable storage medium (or media) having computer readable programinstructions thereon for causing a processor to carry out aspects of thepresent invention.

The computer readable storage medium can be a tangible device that canretain and store instructions for use by an instruction executiondevice. The computer readable storage medium may be, for example, but isnot limited to, an electronic storage device, a magnetic storage device,an optical storage device, an electromagnetic storage device, asemiconductor storage device, or any suitable combination of theforegoing. A non-exhaustive list of more specific examples of thecomputer readable storage medium includes the following: a portablecomputer diskette, a hard disk, a random access memory (RAM), aread-only memory (ROM), an erasable programmable read-only memory (EPROMor Flash memory), a static random access memory (SRAM), a portablecompact disc read-only memory (CD-ROM), a digital versatile disk (DVD),a memory stick, a floppy disk, a mechanically encoded device such aspunch-cards or raised structures in a groove having instructionsrecorded thereon, and any suitable combination of the foregoing. Acomputer readable storage medium, as used herein, is not to be construedas being transitory signals per se, such as radio waves or other freelypropagating electromagnetic waves, electromagnetic waves propagatingthrough a waveguide or other transmission media (e.g., light pulsespassing through a fiber-optic cable), or electrical signals transmittedthrough a wire.

Computer readable program instructions described herein can bedownloaded to respective computing/processing devices from a computerreadable storage medium or to an external computer or external storagedevice via a network, for example, the Internet, a local area network, awide area network and/or a wireless network. The network may comprisecopper transmission cables, optical transmission fibers, wirelesstransmission, routers, firewalls, switches, gateway computers and/oredge servers. A network adapter card or network interface in eachcomputing/processing device receives computer readable programinstructions from the network and forwards the computer readable programinstructions for storage in a computer readable storage medium withinthe respective computing/processing device.

Computer readable program instructions for carrying out operations ofthe present invention may be assembler instructions,instruction-set-architecture (ISA) instructions, machine instructions,machine dependent instructions, microcode, firmware instructions,state-setting data, or either source code or object code written in anycombination of one or more programming languages, including an objectoriented programming language such as Smalltalk, C++ or the like, andconventional procedural programming languages, such as the “C”programming language or similar programming languages. The computerreadable program instructions may execute entirely on the user'scomputer, partly on the user's computer, as a stand-alone softwarepackage, partly on the user's computer and partly on a remote computeror entirely on the remote computer or server. In the latter scenario,the remote computer may be connected to the user's computer through anytype of network, including a local area network (LAN) or a wide areanetwork (WAN), or the connection may be made to an external computer(for example, through the Internet using an Internet Service Provider).In some embodiments, electronic circuitry including, for example,programmable logic circuitry, field-programmable gate arrays (FPGA), orprogrammable logic arrays (PLA) may execute the computer readableprogram instructions by utilizing state information of the computerreadable program instructions to personalize the electronic circuitry,in order to perform aspects of the present invention.

Aspects of the present invention are described herein with reference toflowchart illustrations and/or block diagrams of methods, apparatus(systems), and computer program products according to embodiments of theinvention. It will be understood that each block of the flowchartillustrations and/or block diagrams, and combinations of blocks in theflowchart illustrations and/or block diagrams, can be implemented bycomputer readable program instructions.

These computer readable program instructions may be provided to aprocessor of a general purpose computer, special purpose computer, orother programmable data processing apparatus to produce a machine, suchthat the instructions, which execute via the processor of the computeror other programmable data processing apparatus, create means forimplementing the functions/acts specified in the flowchart and/or blockdiagram block or blocks. These computer readable program instructionsmay also be stored in a computer readable storage medium that can directa computer, a programmable data processing apparatus, and/or otherdevices to function in a particular manner, such that the computerreadable storage medium having instructions stored therein comprises anarticle of manufacture including instructions which implement aspects ofthe function/act specified in the flowchart and/or block diagram blockor blocks.

The computer readable program instructions may also be loaded onto acomputer, other programmable data processing apparatus, or other deviceto cause a series of operational steps to be performed on the computer,other programmable apparatus or other device to produce a computerimplemented process, such that the instructions which execute on thecomputer, other programmable apparatus, or other device implement thefunctions/acts specified in the flowchart and/or block diagram block orblocks.

The flowchart and block diagrams in the Figures illustrate thearchitecture, functionality, and operation of possible implementationsof systems, methods, and computer program products according to variousembodiments of the present invention. In this regard, each block in theflowchart or block diagrams may represent a module, segment, or portionof instructions, which comprises one or more executable instructions forimplementing the specified logical function(s). In some alternativeimplementations, the functions noted in the block may occur out of theorder noted in the Figures. For example, two blocks shown in successionmay, in fact, be executed substantially concurrently, or the blocks maysometimes be executed in the reverse order, depending upon thefunctionality involved. It will also be noted that each block of theblock diagrams and/or flowchart illustration, and combinations of blocksin the block diagrams and/or flowchart illustration, can be implementedby special purpose hardware-based systems that perform the specifiedfunctions or acts or carry out combinations of special purpose hardwareand computer instructions.

The term(s) “Smalltalk” and the like may be subject to trademark rightsin various jurisdictions throughout the world and are used here only inreference to the products or services properly denominated by the marksto the extent that such trademark rights may exist.

The descriptions of the various embodiments of the present inventionhave been presented for purposes of illustration, but are not intendedto be exhaustive or limited to the embodiments disclosed. Manymodifications and variations will be apparent to those of ordinary skillin the art without departing from the scope and spirit of the invention.The terminology used herein was chosen to best explain the principles ofthe embodiment, the practical application or technical improvement overtechnologies found in the marketplace, or to enable others of ordinaryskill in the art to understand the embodiments disclosed herein.

What is claimed is:
 1. A computer program product for programming flashmemory, the computer program product comprising: a non-transitorycomputer readable storage medium and program instructions stored on thenon-transitory computer readable storage medium, the programinstructions comprising: program instructions to pack a non-integercount of logical pages into a first codeword payload data container suchthat the codeword payload data container contains (i) a first logicalpage that does not straddle boundaries of the first codeword payloaddata container and (ii) a first portion of a second logical page,wherein a codeword payload data container size is greater than a size ofa single logical page and less than a size of two logical pages; programinstructions to pack a second portion of the second logical page into asecond codeword payload data container, the second logical page therebystraddling a boundary of the first codeword payload data container and aboundary of the second codeword payload data container; programinstructions to associate a first set of three validation bits with thefirst logical page, wherein the first set of three validation bitsdescribes an arrangement of the first logical page in the first codewordpayload data container using a programmed validation bit that indicatesthat the first logical page is a self-contained logical page; programinstructions to associate a second set of three validation bits with thefirst portion of the second logical page, wherein the second set ofthree validation bits describes an arrangement of the first portion ofthe second logical page in the first codeword payload data containerusing a first pair of consecutive, programmed validation bits, andwherein the first pair of consecutive, programmed validation bits arearranged in the second set of three validation bits such that the secondset of three validation bits indicates that the first portion of thesecond logical page straddles out of the first codeword payload datacontainer; program instructions to associate a third set of threevalidation bits with the second portion of the second logical page,wherein the third set of three validation bits describes an arrangementof the second portion of the second logical page in the second codewordpayload data container using a second pair of consecutive, programmedvalidation bits, and wherein the second pair of consecutive, programmedvalidation bits are arranged in the third set of three validation bitssuch that the third set of three validation bits indicates that thesecond portion of the second logical page straddles into the secondcodeword payload data container; program instructions to generate acodeword payload header that includes data describing an offset to thefirst logical page in the first codeword payload data container; programinstructions to concatenate the first codeword payload data containerand the codeword payload header to generate a codeword payload; programinstructions to generate error-correcting code data based, at least inpart, on the codeword payload using a systematic error-correcting code;program instructions to concatenate the codeword payload anderror-correcting code data to generate a codeword; and programinstructions to program a physical page with the codeword, wherein thephysical page stores a non-zero integer count of codewords.